The invalidity of a strong capacity for a quantum channel with memory 
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The strong capacity of a particular channel can be interpreted as a sharp limit on the amount of 
information which can be transmitted reliably over that channel. To evaluate the strong capacity 
of a particular channel one must prove both the direct part of the channel coding theorem and 
the strong converse for the channel. Here we consider the strong converse theorem for the periodic 
quantum channel and show some rather surprising results. We first show that the strong converse 
does not hold in general for this channel and therefore the channel does not have a strong capacity. 
Instead, we find that there is a scale of capacities corresponding to error probabilities between 
integer multiples of the inverse of the periodicity of the channel. A similar scale also exists for the 
random channel. 

PACS numbers: 03.37.Hk, 89.70.Kn 

Keywords: channel coding theorem, classical capacity, quantum channels with memory 



I. INTRODUCTION 

The full channel coding theorem provides a limit on 
the rate at which a sender can communicate an encoded 
message to a receiver, such that the probability of a de- 
coding error at the receiver's side decays exponentially in 
the number of channel uses. The theorem is comprised 
of two parts: the direct part of the theorem, which refers 
to the construction of the code, and the converse to the 
theorem. The direct part of the quantum channel coding 
theorem states that using n copies of the channel, we can 
code with an exponentially small probability of error at 
a rate R = i log \M\, provided R < C in the asymptotic 
limit, where M. denotes the set of possible codewords 
to be transmitted over the channel and C denotes the 
capacity of the channel. If the rate at which classical in- 
formation is transmitted over a quantum channel exceeds 
the capacity of the channel, i.e. if R > C, then the prob- 
ability of decoding the information correctly goes to zero 
in the number of channel uses. The latter is known as the 
strong converse to the channel coding theorem. The weak 
converse, on the other hand, states that if R > C, then 
the probability of decoding the information correctly is 
bounded away from 1, i.e. the error probability does 
not tend to zero, whatever encoding/decoding scheme is 
used. 

Shannon [l[ first proposed the theorem for classical dis- 
crete memoryless channels and the first rigorous proof of 
the direct part of the theorem was provided by Feinstein 
and the strong converse by Wolfowitz 



* dorlas@stp.dias.ie 
t cqtciara@nus.edu.sg 



However, it was observed that the existence of the strong 
converse, and therefore strong capacity, for other types of 
classical channels does not always hold [4( . See Ahlswede 
J5j for a more complete discussion of converse results for 
various types of classical channels. 

The strong converse to the channel coding theorem 
for memoryless classical- quantum channels with product 
state inputs was determined independently by Winter Q 
and by Ogawa and Nagaoka Q . Their result implies that 
every memoryless discrete classical-quantum channel has 
a strong capacity which provides a sharp upper-bound 
on the rate at which classical information can be trans- 
mitted over this type of channel using product states. 
Recent results include a proof by Bjelakovic and Boche 
@, Q of a full coding theorem for the discrete memory- 
less compound classical-quantum channel. Wehner and 
Konig 10] proved the fully general strong converse theo- 
rem for a family of channels, that is, they proved that the 
strong converse theorem holds for a family of quantum 
channels even in the case when entangled state inputs are 
allowed. 

In this article we relax the assumption that the communi- 
cation channel in question is memoryless and we concen- 
trate on a particular quantum channel with memory, that 
is, a channel with correlations between successive channel 
uses. In our case the correlations between successive uses 
of the channel can be described by a Markov chain. Com- 
munication channels with memory are widely considered 
to be more realistic than memoryless channels since real- 
world channels may not exhibit independence between 
successive errors and correlations are common. Noise 
correlations are also necessary for certain models of quan- 
tum communication [111 ]. See for example Kretschmann 
and Werner [12j | and Mancini [131 ] for models of quantum 
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memory channels. 

The article is organised as follows. We introduce nota- 
tion, necessary definitions and define the quantum peri- 
odic channel in Section |TTJ In Section UTTI we prove that 
the periodic channel does not have a strong capacity. The 
observation relies on a result which is proved in Appendix 
[A1 involving a particular instance of a periodic channel 
and consequently the strong converse does not hold in 
general for the periodic channel. In Section [V] we remark 
on a scale of capacities for the random channel. We then 
state and prove the main result involving a scale of ca- 
pacities for the channel. 

Note that log is understood to be taken to the base 2 
throughout the article. 



II. PRELIMINARIES 

We begin by introducing some notation. A memoryless 
channel is given by a completely positive trace-preserving 
(CPT) map $ : B{H) B{K), where B(H) and B(K) de- 
note the states on the input and output Hilbert spaces 
H and IC, respectively. 

Equivalently, we can describe a classical-quantum chan- 
nel, here also denoted f , as a mapping from the classical 
message to the output state of the channel on B(1C) as 
follows, 



$:Afi-> B(K), 



(2.1) 



where the message is first encoded into a sequence belong- 
ing the set X n , where X represents the input alphabet. 
We can combine the two mapping descriptions as fol- 
lows. We wish to send classical information in the form of 
quantum states over a quantum channel $. A (discrete) 
memoryless quantum channel, $, carrying classical infor- 
mation can be thought of as a map from a (finite) set, or 
alphabet, X into B(IC), taking each x £ X to $> x = Q(p x ), 
where the input state to the channel is given by {p x }x£X 
and each p x e B{%). Let d = dim('H) and a = \X\. 
For a probability distribution P on the input alphabet 
X, the average output state of a channel $ is given by 



Pa = J2 P&MPx)- 



(2.2) 



x£X 



The conditional von Neumann entropy of $ given P is 
defined by 



= J2 P(x)S(HPx)), 



(2.3) 



x£X 



and the mutual information between the probability dis- 
tribution P and the channel $ is defined as follows, 



J(P;$) = S(Pa) - S($|P). 



(2.4) 



An n-block code for a quantum channel $ is a pair 
(C™, E n ), where C n is a mapping from a finite set of mes- 
sages M, of length n, into X n , i.e. a sequence x n € X is 



assigned to each of the \A4 \ messages, and E n is a POVM, 
i.e. a quantum measurement, on the output space lC® n 
of the channel . The maximum error probability of 
the code (C n ,E n ) is defined as 

Pe (C n ,E n ) = max{l-TJ:($£» (m)J E£) : m e M}. (2.5) 

The code (C n ,E n ) is called an (n, A)-code, if 
p e (C n ,E n ) < A. The maximum size lA^I of an (n, A)- 
code is denoted iV(n, A). Define an finite alphabet X 
and sequences x n — x±, . . . , x n £ X n and let 



N(x\x n ) = \{i E {1, . . . ,n} : Xi = x}\ 



(2.6) 



for x £ X. The type of the sequence x n is given by the 
empirical distribution P x n on X such that 



P xn (x) 



N(x\x n ) 



(2.7) 



Clearly, the number of types is upper bounded by (n+l) a , 
where a = \X\. 



A. Coding theorem and strong converse 

The strong capacity of a particular channel provides 
a sharp threshold on the rate at which information may 
be transmitted over that channel with exponentially de- 
creasing probability of decoding error in the number of 
channel uses. In order to establish a strong capacity for 
a particular channel one must prove both existence of a 
capacity achieving code and the strong converse. 
The direct part of the coding theorem for memoryless 
quantum channels with product-state inputs was deter- 
mined independently by Holevo [14f and Schumacher and 
Westmoreland [l5j]. Winter @ and Ogawa and Nagaoka 
Q independently proved the strong converse for memo- 
ryless quantum channels. 

In Section IIIII we require a version of the strong con- 
verse theorem proved by Winter [6| which holds for a 
single codeword type. We therefore provide this version 
fLemma III. II) below, following both the direct part and 
strong converse theorems for memoryless classical quan- 
tum channels as stated and proved in Q. 

Theorem II. 1 (Direct part) 

For all A € (0, 1) and 5 > there exists no (A, 6) <E N such 
that for all n > hq and every classical quantum channel 
$ and probability distribution P on X , there exists an 
{n,X)-code such that the number of messages satisfies 



\M n \ > 2"(x*(i , )-<5) ; 

where the Holevo capacity \* is given by 

**(<&)= sup /(P;$) 
p 



(2.8) 



(2.9) 



the supremum being over all probability distributions P 
on X. 
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Theorem II. 2 (Strong converse) 

For all A £ (0,1) and all 5 > there exists ni(X,S) 
such that for all n > n\ and every memoryless classical 
quantum channel $ and the number of messages of an 
(n, X)-code is bounded by 



\M n \ < 2"(x*(*)+< 5 ). 



(2.10) 



Remark. Winter in fact proved a stronger version of 
these theorems in which S is replaced by a constant times 

In the following we follow the approach of Winter ( [6j , 
Theorem 13) in which the strong converse is derived from 
a bound on the number of codewords of a given type P: 

Lemma II. 1 (Single- type strong converse) 
For A G (0, 1) and S > there exists ni(X,S) such that 
for n > ni, every (n, A) -code for which all codewords are 
of the same type P , 



\M 



n,P 



< 2 



n(I(P; <S>)+5) 



(2.11) 



The strong converse follows immediately from this 
lemma using the fact that the number of types is upper 
bounded by (1 -I- n) a (see [l(| Lemma 2.2). 

Remark. In contrast to the strong converse where the 
decoding error goes to 1 exponentially in the number of 
channel applications if R > C , the weak converse states 
that if R > C , then the probability of decoding the infor- 
mation correctly is bounded away from 1 . 



B. Quantum channels with classical memory 

Next, we provide definitions needed to describe quan- 
tum channels with classical memory [13] • Let / denote 
a countable set and let A; = ¥(X — i), where X is a 
random variable taking values in the state space /. Let 
Q denote a transition matrix, with entries labeled qju. A 
discrete time random process denoted X n can be consid- 
ered to be a Markov chain with transition matrix Q and 
initial distribution A, if and only if the following holds 
for i , ■ . . E I, 



P (X = i ,X 1 

= ^io ( li 1 \i Qi2\ii 



- i\, . , . , X n -i 
9i„_i |i„-2 ■ 



in-l) 



(2.12) 



In [18] Datta and Dorlas analyse a quantum channel of 
length n with Markovian noise correlations, first defined 
by Bowen and Mancini [ljj, as follows 



* b (p b )=X>- 



>*<n-x)(P B ) 



(2.13) 

where qju are the elements of the transition matrix of a 
discrete-time Markov chain, and {A^} represents an in- 
variant distribution on the Markov chain. 
In Section [IIII we analyse a particular channel with clas- 
sical memory, namely the periodic channel. We describe 



this channel below. 

A periodic channel acting on an n-fold input state can 
be described as follows 



1 



L-l 



i=0 



i+1 



(2.14) 

where $i are CPT maps acting on the same Hilbert 
space and the index is cyclic, modulo the period L, i.e. 

= <Jv In this case the elements of the correspond- 
ing transition matrix are given by qj\i — Oij, where 



J h3 



if j = i + 1 
otherwise. 



mod L 



(2.15) 



The product-state capacity of the channel, denoted C p is 
given by 



L-l 



C p ($) = i S up^/(P;$ i ). 



(2.16) 



i=0 



The proof of direct part of the channel coding theorem for 
the periodic quantum channel is provided in Appendix B 
of [20|| . This is in fact a special case of the main result 
proved by Datta and Dorlas in [lj| . Note that the proof 
of the direct part of the coding theorem for this channel 
makes use of a preamble to the code which the receiver 
uses upon receipt to determine which branch of the chan- 
nel was selected. 

Another channel of the general type (I2.13P is the ran- 
dom channel. It is given by 



A I 



(2.17) 



where $^ (i = 1, . . . , M) are CPT maps acting on the 
same Hilbert space and q± , . . . , qM is a probability dis- 
tribution. In this case the elements of the corresponding 
transition matrix are given by q^ — Sij. It was shown 
in [2l| that the product state capacity of this channel is 
given by 



M 

C p (<&) = supmin/(P; $,). 

p i—l 



(2.18) 



We will remark on this channel, which like the periodic 
channel has long-term memory, in Section fVl 



III. CHANNEL WITHOUT A STRONG 
CONVERSE 

The strong converse for the periodic quantum channel 
does not hold in general because the following inequality 
holds 



(3.19) 
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where, 

— 1 L_1 

C P = - Vsup/(P,$,). (3.20) 
L tt p 

The strict inequality above can be shown explicitly for 
a periodic channel consisting of two branches of qubit 
amplitude-damping channels (see Appendix [X] below for 
detailed proof). On the other hand, equality for expres- 
sion (|3.19p can be shown to hold for a periodic channel 
with depolarising channel branches (22j . 

Let us now investigate whether we can prove a full 
coding theorem for rates R such that 

C P <R<CT P . (3.21) 

We first define the average probability of error as follows 

L-l 

Te = T Y,Pe<^ ( 3 - 22 ) 

i=0 

where p\ denotes the probability of error for the i-th 
channel branch. 

Our coding strategy is as follows. We choose a code 
i.e. a particular encoding and decoding scheme, suit- 
able for a particular channel branch labeled by the index 
i € {0, . . . , L— 1}. Here a 'branch' is defined as one term 
in the sum (|2.14l) i.e. $| n) = $j <g) <g> • • • <g> $ i+n _i. 
According to the coding theorem for memoryless chan- 
nels, there is a code with error probability tending to 
zero for this branch with rate R. Indeed, for each j there 
exists a probability distribution Pj of states optimising 
Xj = supp I(P; $j) and we can choose states from a 
typical subspace for these distributions, which can be in- 
terlaced at the positions j ~ i + kL, where fee — 1]. 
The probability of choosing a particular branch correctly 
is given by 4 and therefore the probability of error ap- 
proaches 

Pe = < 1. (3.23) 

We thus have a A-code for all A > 1 — In particular, 
the error probability is bounded away from 1, and the 
strong converse does not hold. 

On the other hand the strong converse does hold for 
R > C p . Indeed, the codewords can be decomposed into 
sub-codewords corresponding to the different stages of a 
period: x n — (xq , . . . , x^-i)' wnere the components of 
the x™ are understood to be interlaced in x n . We distin- 
guish types Po, . . . ,Pl-i for the sub-codewords. Then 
we have an analogue of the single-type strong converse 
given by Lemma III. II 

Lemma III.l For A G (0,1) and 6 > there exists 
ni(A, S) such that for n > n\, every (n, A) -code for which 
all sub-codewords are of the same type Po, . . . , Pe,-i, 
given that the i-th branch is selected, 

|^„,p ,... a _ 1 | < 2*E^Wfl.;»^)+fl. (3.24) 



Clearly, for the complete channel, it follows that the 
number of codewords such that the sub-codewords are 
of types Po, . . . , Pl_i, satisfies 

|>kPo,...A-il < 2T^=o( su Pp'(^+fc)+ 5 ). (3.25) 

Summing over the types, we obtain the strong converse. 

We can conclude that the strong converse holds for 
rates R> C„. 



IV. A SCALE OF CAPACITIES 

The above obviously raises the question if smaller error 
probabilities can be attained for smaller rates, but still 
above C p . For this, we define a 'pair capacity' C p 2 ^ as 
follows: 

1 L_1 

ZL 0<ii<i2<£ * — ' p 
k=Q 

(4.26) 

Suppose the maximum is attained at a certain pair 
*a)< With probability 2/L, one of the two branches 
it or «2 is chosen. We attach a preamble to the code as 
in the proof of the product-state capacity of the periodic 
channel (|2.16l) . If, for example, the branch i% is selected 
by the channel, the receiver can determine that this is 
the case by measuring the preamble, and can then choose 
states for each value of k from the typical space corre- 
sponding to the maximising distribution P^ for the CPT 
map This constitutes an encoding with rate given 

by the average of the mutual informations I(P;<f>i 1+ k) 
for k = 0, . . . , L— 1, which is greater than or equal to the 
pair capacity C ^ given by Equation (|4.26l) . We have 
thus constructed a A-code for A > 1 — j-. 

(2") 

On the other hand, let R > C p and suppose that 
(C n ,E n ) is a sequence of (n, A)-codes with A < 1 — ^, 
and assume that 

-\og\M n \ >R>CW. (4.27) 

n 

First note that we may assume that the number of 
codewords with sub-codewords of types Po,...Pl-i is 
bounded by 

1 1 

-logl-Mn.fl,,...,^! < T y;(J(i\, «*+*) + *) (4.28) 

k=Q 

for some fixed i = 0, . . . , L — 1. Indeed, otherwise, by 
Lemma riILl| p\ > A for all i and hence p e > A. 

We now claim that for every other j / i, and e > 
small enough, 

1 1 

k=0 
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If this were not the case then the pair capacity for a 
single type P k can be written as 



L-l 



iloglMMv.^J < J^(/(P fe ,$ l+fe ) 



fc=0 



I(P k ,$ j+k )+6) (4.30) 



and hence 

ilogl-M^fb,...,^! < iy"(sup{I(P,$ i+ ife) 
2L ^o p 
+ 7(P,^ +& )} + <5). (4.31) 



L-l 



Summing over the types Pq, . . . , Pl-i leads to a contra- 
diction with (|4727j) . 

Now, expression (|4.29l) implies with Lemma Till. 1 1 that . 
if the j-th branch is selected by the channel, then the 
error probability p 3 e > 1 — r\ for any r\ > 0. Since with 
probability 1 — j- one of the branches j other than i 
is selected, we conclude that the error probability p e > 
(l — j-) (1 — ?/) > A if 77 < 1 — — A is small enough. 

It is now clear that this argument can be generalised 
to prove: 

Theorem IV. 1 Define, for r = 1, . . . , L a scale of ca- 

(r) 

pacities C p by 

L-l r 

C« = — max V sup V I(P; $ im+fc ). 
p rL o<n< - <i r <L ^ P ^ 

(4.32) 

(Note that C p 1] = C p and C p L) = ~C~ V .) Then, if A > 
1 — and R < C p r \ there exists a sequence of (n, A)- 
codes with rate R. Conversely, if X < 1 — *-t^-, there 
exists no sequence of (n, X) -codes with rate R > C p r ^ . 

V. THE RANDOM CHANNEL 



For the converse to the theorem, we introduce another 
scale as follows: 



= sup max J(P; 

p ieA 



(5.35) 



Then, if X < 1 — q(A) there exist no (n, X)-codes with 



rate R>C£. 



The situation is less clear-cut than it seems, however. 
In fact, not every q(A) is necessarily a point of discon- 
tinuity for the capacity, because is in general not 
monotonic in the probabilities 9(A)! 



VI. DISCUSSION 

One of the most surprising and interesting results 
which has emerged from Shannon Theory is the obser- 
vation that the strong information-carrying capacity of a 
memoryless channel is independent of the upper bound 
on the maximum error probability of that channel, usu- 
ally denoted A. The independence of the parameter A is 
crucial to the existence of a so-called strong capacity for 
the channel 

The dependency of some channel capacities on this pa- 
rameter A, including non-stationary discrete memoryless 
classical channels, led to the definition of a capacity func- 
tion Note that recently Ahlswede [23| proved that 
the capacity functions can now be thought of as so-called 
capacity-sequences. 

For the case of the quantum periodic channel, and also 
the random channel, we have shown that an analogous 
parameter-dependent capacity can be defined, which 
takes the form of a scale of capacities applicable for var- 
ious ranges of the error parameter. 

Note. It appears that similar results to ours were ob- 
tained by Datta, Hsieh and Brandao [2J|, using different 
methods. 



The situation for the random channel is similar, but 
more complicated due to the fact that different branches 
can have different probabilities qi. We can in general 
distinguish break points at values of the error probability 
given by 



?(A) = £>, Ac{l,...,M}. 



(5.33) 



We have an analogue of the detailed theorem for periodic 
channels above: 

Theorem V.l Define, for A C {1, . . . , M} a scale of 

capacities by 



Cf = sup mini (P; 

p ieA 



(5.34) 



Then, if X > 1 — q(A) and R < C p , then there exists a 
sequence of (n, A) codes with rate R. 



Appendix A: The periodic channel with 
amplitude-damping channel branches 



The qubit amplitude-damping channel acting on the 
a b 
b 1 - a 



state p = 



is given by 



4> 



amp 



(p) = 



a+(l — 0)7 — 7 

b^T^ (i_o)(l- 7 ) 



(1.1) 



The expression for the product-state capacity of the qubit 
amplitude-damping channel is given as follows, 

= S 



V" I Pj (a 3 _+ (1 -a 3 ) l) Pjbjs/ (1 - 7) 
^\ pfoy/O^y) Pi (l- ai )(l- 7 ) 



bjVT^ (1-0,0(1-7) 



(1.2) 
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We now investigate whether the following equation holds 
for a periodic channel with two amplitude-damping chan- 
nel branches 

1 1 1 1 

P sup ^2xi({Pj,Pj}) = , sup Xi({Pj,Pj})- 

(1.3) 

Let 70 and 71 represent the error parameters for two 
amplitude-damping channels 4> and <I>i respectively. We 
have argued |25[ that the Holevo quantity for the qubit 
amplitude-damping channel can be increased using an en- 
semble containing two mirror image pure states each with 
probability h. Using this minimal ensemble we investi- 
gate both sides of Equation (|1.3I) , for a periodic channel 
with two qubit amplitude-damping channel branches. 
Clearly the left hand side of Equation (ll.3[) will be at- 
tained for a single parameter which we denote by a max . 
However, the right hand side of Equation (I1.3P can- 
not be obtained by a single a max . Instead, the supre- 
mum for each channel will be attained at a different 
value of the input state parameter a. We denote by 
(imaxo and a maxi the state parameter that achieves the 
product-state capacity for the channels $0 and $1 re- 
spectively. Let Xo(d) and Xi( a ) denote the Holevo quan- 
tities of the channels $p and $ 1, respectively. Denoting 
Xi = \Jl — 4ji (1 — 7i) (1 — a 2 ) the eigenvalues for each 
of the amplitude-damping channels can be written as 



K mPi ± = - 1 ± Vl - 4 7i(l - 7i)(l - o) 2 ) • (1.4) 



the values for a maxo and a maxi can be determined by 
separately solving the following equation for each channel 

- (1 - 7j)ln 



da 



a + (1 - a)7, ; 

27i(l-7»)(l-a), A + Zt 
In 



■l- i 



1 - Xi 



0. 



(1.5) 



Let X* avg {lo,li, a maxo , a maxi ) denote the average of the 
supremum of the Holevo capacities of the channels $0 
and $1, i.e., 

Xavg (70 ! 7l 1 a maxo ? a maxi ) — ~ \%Q { a maxo ) Xl i. a maxi )) ■ 

(1.6) 

It is not difficult to show that 



a) Taking 70 = 0, the expression X*(70i 7i, fl r. 
comes 



be- 



X*(7o = 0,7i,a maa: ) = H bin (a maxl ) 

+ H bin ((l - a maxl )(l - 71)) 

- S ($1 (/Wax)) ■ (1.9) 



Denoting X* (70,71 , a 



1) b y Xlvaill) 



maxQ 1 Umax 

the right hand side becomes 

+ H bin ((l - a max i)(l - 71)) 

ama.fi 

)). (1.10) 



Clearly, a ma x = |- To show that a m(M < a maxi , 
we must show that 4- J2i Xi{ a ) < at a = a maxi 



For 70 = the Holevo quantity of the channel $0 
becomes 



xo(o)=s(; (1 ° 0) ) -s(p). 



1.11 



But p is a pure state and therefore S(p) = 0. There- 
fore, from Equation (|1.5[) . 



(1.12) 



dxo(a) = ({I -a) 



da 



We have previously shown that the maximising 
state parameter for the amplitude-damping chan- 
nel is achieved at a > \ [25[. We are considering 
the case where 70 7^ 71, i.e. 71 7^ 0, therefore 
amaxi > \- The expression xo( a ) now represents 
the binary entropy, H(a), and is therefore max- 
imised at a — 5. It was shown above that the en- 
tropy S (a) is a strictly concave function for 70 = 
and Xo( a ) is therefore decreasing at a = a maxi . 

The capacity x*(°0 is achieved at a = a rnaxi . 



Therefore 



is equal to zero at this point. 



We can now conclude that ^SiXi( a ) < when 



a = a r 



and therefore 



X*(7o = 0,7i,a TOOa; ) < xlvqijo = 0,71, a 



rnaxo 3 ^niax 



J- 

(1.13) 



X*(70 = l)7l) a mos) =Xa-uo (70 = 1, 7l ; Wo > a ma Xl ) ■ 

Similarly, we can show that 

X*(70,7l = l:«mm) = X*avg{la-,ll = 1) amaxo-, a max 1 )- 

Next, we show separately for a) % = and for b) < 
7i < 1 that the following inequality holds 

X* (70, li,a max ) < Xavqho )• (1-8) 



b) We now show that an inequality exists be- 
tween the expressions x* (70, 7lj a max) and 
xL g (7o,7i,a ma:l;o ,a ma:E1 ) for fixed 70, such that 
< 70 < 1. 

In (2f| we proved that if 70 < 71, then x(7o) > 
x(7i) and therefore a maXQ < a maxi . Therefore, 

^— <C at d — Q"maxi &Fld Q>max ^ ^max\ • 

Similarly, if 70 > 71, then a ma;C o > amasci and 

^— !> at Q — Arnaxi and Clmax ^maxi- 



7 



As a result, a rnax will always lie in between a maxo 
and a maxi . We have previously shown [25| that the 
Holevo quantity for the qubit amplitude-damping 
channel is concave in its single state parameter. 
Therefore a max > where a is the parame- 
ter value associated with xL 9 (7, 7i> «max , dmon), 
i.e. J]j su PaXi( a ) = X7 , 7l (a)- This proves that 



X*(70,7l,a ma2; ) < X*av g (j^li a max ,a maXl ). 

In conclusion, if 70 = 1 or 71 = 1, then a max = a maxo 
or a max = a maxi respectively and X*(7o,7i> a max ) = 
Xlvg (7> li,a maX i, dmaxi)- However, if 70,71 h then 

X* (70 , 71 > a m ax ) < X*avg (% 7l ) «ma! i , a„ la2; t ) . 
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